Statistical Inference for Streamed Longitudinal Data
نویسندگان
چکیده
Summary Modern longitudinal data, for example from wearable devices, may consist of measurements biological signals on a fixed set participants at diverging number time-points. Traditional statistical methods are not equipped to handle the computational burden repeatedly analysing cumulatively growing dataset each time new data collected. We propose estimation and inference framework dynamic updating point estimates their standard errors along sequentially collected datasets with dependence, both within between datasets. The key technique is decomposition extended function vector quadratic constructed over cumulative into sum summary statistics batches. show how this can be recursively updated without need access whole dataset, resulting in computationally efficient streaming procedure minimal loss efficiency. prove consistency asymptotic normality our estimator as batches diverges, even independent remains fixed. Simulations demonstrate advantages approach traditional that assume independence Finally, we investigate relationship physical activity several diseases through analysis accelerometry National Health Nutrition Examination Survey.
منابع مشابه
statistical inference via empirical bayes approach for stationary and dynamic contingency tables
چکیده ندارد.
15 صفحه اولA New Nonparametric Regression for Longitudinal Data
In many area of medical research, a relation analysis between one response variable and some explanatory variables is desirable. Regression is the most common tool in this situation. If we have some assumptions for such normality for response variable, we could use it. In this paper we propose a nonparametric regression that does not have normality assumption for response variable and we focus ...
متن کاملExploring New Statistical Methods for Causal Inference in Longitudinal Studies
This study proposes a novel method to provide unbiased effect estimates in the presence of time-dependent confounding and applies the method in an existing merged multi-source nursing home dataset to examine the effect of antipsychotic medication use on all cause mortality. Using standard methods, effect estimates of time-varying exposures from observational data will be biased in the presence ...
متن کاملSimultaneous Statistical Inference for Epigenetic Data
Epigenetic research leads to complex data structures. Since parametric model assumptions for the distribution of epigenetic data are hard to verify we introduce in the present work a nonparametric statistical framework for two-group comparisons. Furthermore, epigenetic analyses are often performed at various genetic loci simultaneously. Hence, in order to be able to draw valid conclusions for s...
متن کاملStatistical Inference for Large Scale Data
We introduce a sparse and positive definite estimator of the covariance matrix designed for high-dimensional situations in which the variables have a known ordering. Our estimator is the solution to a convex optimization problem that involves a hierarchical group lasso penalty. We show how it can be efficiently computed, compare it to other methods such as tapering by a fixed matrix, and develo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Biometrika
سال: 2023
ISSN: ['0006-3444', '1464-3510']
DOI: https://doi.org/10.1093/biomet/asad010